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ABSTRACT 

The statistical meaning of the local non-Gaussianity parameters /nl and ^nl is examined in detail. 
Their relations to the skewness and the kurtosis of the probability distribution of density fluctuations 
are shown to obey simple fitting formulae, accurate on galaxy-cluster scales. We argue that the 
knowledge of /nl and ^nl is insufficient for reconstructing a well-defined distribution of density 
fluctuations. However, by weakening the statistical significance of /nl and ^nLj it is possible to 
reconstruct a well-defined pdf by using a truncated Edgeworth series. We give some general guidelines 
on the use of such a series, noting in particular that 1) the Edgeworth series cannot represent models 
with nonzero /nl, unless ^nl is nonzero also, 2) the series cannot represent models with ^nl < 0, 
unless some higher-order non-Gaussianities are known. Finally, we apply the Edgeworth series to 
calculate the effects of ^nl on the abundances of massive clusters and large voids. We show that the 
abundance of voids may generally be more sensitive to high-order non-Gaussianities than the cluster 
abundance. 
Subject headings: Cosmology: theory - large-scale structure of universe. 



lsiri@astro.ox.ac.ukl 



1. INTRODUCTION 

One of the great challenges for 21st-century cosmology is to understand the statistical distribution of primordial 
seeds that eventually grew to become large-scale structures. If these primordial seeds, or density fluctuations, were laid 
down by a simple inflationa ry mechanism c onsistin g o f a sin g le sca lar field, the initial distribution is expected to be very 
close to Gaussian (see e.g. iBartolo et alj (|2004al) or iChenI ()2010D for reviews). However primordial non-Gau ssianity 



can be la rge in more complex models such as multi- field inflation ("Rigopoulo s et al" [2006'; 'Bvr nes fc Choill2010D , brane 
inflation (|Chenll2005t iLanglois e{~al\ \2008) or in the curvaton model (Bartol oeran i2004b: Sas aki et al.ll2006l ). 



Given a physical model that generates primordial density fluctuations, it has become common practice to calculate 
the 'local' type of non-Gaussianity parametrized by constants /nl and, less commonly, (/nl, deflned by the expansion 
of the non-linear Newtonian potential 

<I' = '^ + /NL(</''-(0'))+gNL0' + ..., (1) 

where (f> is a, Gaussian random held. Observational constraints on . fNL from t he cosmic microwave background (CMB) 
anisotropies are currently consistent with /nl = 32 ± 42 f2a)(K omatsu et al. 2010). Constraints on ^nl are much 
weaker, with |5nl| ^ 6 x 10^ reported bv lVielva fc Sanzl ()2010[ ). These limits should improve by at least an order of 
magnitude with results from the Planck satellitqj. 

But what do these numbers actually tell us about the distribution of density fluctuations? After all, non-Gaussianity 
is a property of probability density functions (pdf ) and thus it is important to understand exactly how /nl and ^nl 
relate to the statistics of the density fluctuations. At leading order, /nl and qnl are proportion al to the skewness and 
the excess kurtosis of the distribution of primordial density fluctuations (jDesiacaues et al1l2008[ ). But as we shall see in 
this paper, the knowledge of /nl and (7nl is insufficient for the reconstruction the distribution of density fluctuations 
at late times, as the corresponding pdf cannot be positive deflnite. 

It has been said that characterising non-Gaussianity is akin to characterising a "non-dog" . Whilst this is true in the 
sense that there are infinite possibilities of probability distributions that are not Gaussian, it is misleading because 
different types of non-Gaussianity can be systematically characterised, for instance, by the deviations in the moments 
or cumulants from the Gaussian values. This is, in fact, the idea behind the Edgeworth expansion, which expresses a 
wea kly non-Gaussian distribution as the Gaussian distribution multiplied by a Taylor series consisting of cumulants 
(see IBlinnikov fc Moessneij ()1998D for a review). 

There have been a number of works that use the Edgeworth series to study the distributio n of density ffuctuations 
(jBernardeau fc KofmanI [l995t iJuszkiewi cz et al.lll995t IAmendolall200d iLoVerde et al.|[2008() . However, these works 
invariably truncate the Edgeworth expansion at just a few terms. In calculations of the abundance of massive clusters, 
the truncated series is often implicitly assumed to remain valid far out into the exponential tail of the distribution. 
In our opinion, it is very difficult to judge the validity of such calculations without establishing first the accuracy of 
the truncated Edgeworth series rigorously. In this paper, we shall address this issue by quantifying how sensitive the 
abundances of rare objects are to changes in the truncation order. 

Many previous applications of the Edgeworth series also faced with the problem that the resulting pdf is negative in 
some region. This is often attributed to fact that there are an insufficient number of terms in the Edgeworth series. The 
conditions needed for the truncated Ed geworth expansions to be non-negative have been addressed in simple cases by 
a few works in the statistical literature (jDraper fc Tiernevlll972t iBalitskava fc Zolotuhinal[l988l : iJondeau fc Rockingeri 
l2001f ). In this paper, we shall investigate this problem numerically and show that the positivity of the pdf generally 
depends not only on the number of terms in the series, but also on the available information on higher-order moments. 
Wc shall present some general guidelines on how the Edgeworth expansion could be used to model a non-negative 
non-Gaussian pdf. 

Having developed the necessary background for proper use of the Edgeworth expansion, we shall apply it to study 
the effects of (Jnl on the number densities of massive clusters and large voids. 

2. THE PRIMORDIAL DENSITY FLUCTUATIONS 

First, let us introduce the necessary parameters which will allow us to describe the distribution of density fluctuations 
statistically. 

Let pc Pb, Pr, Pa be the time-dependent energy densities of cold dark matter, baryons, radiation and dark energy. 
Let pm — Pc + Pb- We define the density parameter for species i as 

n. ^ ^i(i^, (2) 

Pcrit 

where pcrit is the critical density defined by pcrit = SHq/SttG. The Hubble constant, Hq, is parametrized by the usual 
formula Hq = 100ft. km s~ Mpc~^. Results from a range of astrophysical ob servations are consist e nt with h c:^ 0.7, 
Oe ~ 0.23, rjfc ~ 0.046 and ^r ~ 8.6 x 10"^, with ^a = 1 - rj„, - Qr [see e.g. iKomatsu et~all ()2010[ ): iLahav fc Liddld 

The density fluctuation field, S, is defined as 



^ http://plaiick.cf.ac.uk 



c/ ,N _ Pm(x,t - {Pmjt)) , . 

'5X,t)= -— , (3 

{Pm[t)) 



where (pm) is the mean matter energy density. In Fourier space, the density fluctuation field can be decomposed as 



5(x,i) 



(2^)= 



6(k,t)e 



ik-x 



(4) 



As we shall be dealing mainly with observables measured at the present time, to, we simply write S(k) to mean S(k., to). 
The gravitational Newtonian potential $ is related to the density fluctuation by the cosmological Poisson equation. 
For a Fourier mode k, this reads 



5(k) 



Sflr 



k 

h'o 



$fk) 



(5) 



Statistical information on (5(x) can be deduced from that of S(k). The finite resolution of any observation, however, 
means that we can only empirically obtain information on the smoothed moments of the distribution. More precisely, 
given a length scale R, the smoothed density field, 6r, observed today is given by 



(5fl(k) == W{kR)D{0)T{k)6{k), 



(6) 



where k = |k| and -D(O) « 0.76 is the linear growth factor evaluated at 2: = 0. We choose W to be the spherical top-hat 
function of radius R. In Fourier space, we have 



W{kR) = 3 



sm{kR) cos{kR) 



(fci?)3 (fci?)2 

It is also useful to define the mass of matter enclosed by the top-hat window as 

M.l.i?3,„,.1.16xl0-(^)%-M,. 

We follow the approach outlined in [Weinberg ()2008[ ) and use the transfer function T of Dicus 



T{x) 



ln[l + (0.124a;)2 
(0.124a;)2 



1 + (1.257a;)2 + (0.4452a;)4 + (0.2197a;)^ 



1 + (1.606a;)2 -|- (0.8568a;)4 + (0.3927a;)6 



1/2 



(7) 



(8) 



(9) 



In addition, we also incorporate the baryonic correction of lEisenstein fc: Hul (|1998[ ), whereby the transfer function is 
evaluated at 



ka 



1/2 



XEH = 



HqQ,„i 



1-a 



1 + (0.43A:s)4 



(10) 



with 



and 



= 1 - 0.328\n{43m,nh^)—^ + 0.38ln{22.3n,nh^) ( —^ 



44.51n(9.83/an/i') ,. 
s = — _ _ , . Mpc. 



VT+ToowF^ 

The matter power spectrum, P{k), can be defined via the two-point correlation function in Fourier space as 

(,5(ki), <5(k2)) - (2^)35,,(ki + k2)P(fc), (11) 

where Sjj is the 3-dimensional Dirac delta function. In linear perturbation theory, it is usually assumed that inflation 
laid down an initial spectrum of the form fc"= , where n^ is the scalar spectral index (assumed to be 0.96 in this 
work). Physical processes which evolve P{k) through the various cosmological epochs can simply be condensed into 
the equation 



P(k) ex P^{k)T^{k), 
where Pip{k) ex fc"=^ . It is also common to deflne the dimensionless power spectrum V{k) as 

Vik) . ^^P,ik) oc (A 
Consequently, the variance of density fluctuations smoothed on scale R can be written as 



4 







- A\k)T{k), 



(12) 



(13) 



(14) 



where 



A{k) = 






Hu 



W{kR)D{0)T{xEii)- 



(15) 



In our numerical work, we shall normalise V{k) so that 

ag, = <7{R = 8h-^M-pc) = 0.8. (16) 

3. STATISTICAL INFORMATION IN /nl AND ^nl 

The most widely studied type of non-Gaussianity is the 'local' type parametrized, at lowest orders, by /nl and 5nl, 
which are the coefficients in the Taylor expansion of the non-linear Newtonian potential, $, in terms of the linear, 
Gaussian field, 6, 



$(x) = 0(x) + /nl (0'(x) - {4>^)) + .gNL0'(x) + . . 



(17) 



This form of non-Gaussianity arises in si mple models of single and multi-fie ld inflation (Bartolo ct al.' '20043 
Rigopoulos et al.l 120061 : iBvrnes fc Choil |2010[ ) as well as some curvaton models (jBartolo et al.. .2004h : .Sasaki et ahl 
2006[) . In this work, we shall assume non-Gaussianity o nly of this form. In gene ral, it is possible that non-Gaussianity 
may be n on-local. Mechanisms such as DBI inflation (jAlishahiha et al.l I2004D or inflation with a non-standard La- 
grangian (jArkani-Hamed et al.ll200^ iChen et al.ll2007|) are known to generate primarily non-local non-Gausssianity. 
We comment on these possibilities later, but leave a full investigation for future work. 

We adopt the 'large-scale-structure' convention in which $ is extrapolate d to z = 0. We also ta k e /nt, and j?n t, to be 
constant, although it is conceivable that they may be scale dependent (see iSefusatti et aP (|2009[ ): iCayon et all ()2010l ) 
for constraints on the 'running' of /nl)- In this section, we investigate how this form of non-Gaussianity is related to 
the reduced cumulants, Sn, defined by 



SniR) 



mc 



2n-2 ■ 



(18) 



where {S^)c is the nth cumulant. For a distribution with zero mean, the relationships between the first few cumulants 
and moments are 






(4>c 



(4) 



icj\ 



(19) 



Throughout this work we shall often make references to the skewness and kurtosis, which are defined respectively 
as {S^}/o'% and ('^|j)/o'|j. The excess kurtosis is defined as (4)/o'|j — 3, with 3 being the kurtosis of the Gaussian 
distribution. 

3.1. /nl 
At leading order, /nl is related to the skewness via the relation derived in lDesjacques et all (|2008l ) 



^%(i?)-/NL/ -^ A{k,)V{k,) -^A{k2)V{k2) dflAiks 

Jo «1 ^0 «^2 J-1 



1-^2 



P,p{k3 



PAk2) 



(20) 



where fc| = fc^ + fc| + 2/ifcifc2. Figure [T] (upper curve) shows the cumulant ^3 as a function of a^ in the mass range 
lO^'^ — 10^^ h^^Mpc. On these scales, the weak scale-dependence in S3 can be accurately fitted by a simple formula 

3.15 X 10-4 X /nl .„^. 



with sub-percent accuracy. This fitting formula offers an easy way to calculate the observational signatures of /nl 
without resorting to the integrals in (|2n|) . 

3.2. gNL 
If /nl = 0, we can similarly derive the leading-order relation between (7nl and the excess kurtosis 



a^Si{R)-- 




dki 



A{h)V{h) / d^i, / d0, A{ki) 



where 



1/2 



fc4 = (fci +kl + kl+ 2kik2Ql2 + 2^2^3823 + 2fcifc3ei3) ' , 



e. 



[(1 - /^?)(i - Mi) cos(0i - (i)j) + m^ij] 



1/2 



,P^{ki 



Pdh) 



(22) 

(23) 
(24) 
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Figure 1. Solid lines show the reduced cumulants S3 and ^4 for /j^jl = 1 and g^L = 1 as a function (t{M), with mass M ranging between 
lO^'* — 10^' ^ h~ ^Mp, . Hi gher values of /nl ^dcI g^L scale multiplicatively. The overlapping dashed lines show the fitting formulae given by 
equations II21I I and II25I I. 



The result of this integration is shown as the lower curve in figure [T] As before, we found a fitting formula 

5.53 X 10-*^ 



S4 



X 5NL 



^1.70 



(25) 



accurate on the same mass scale. 

In summary, we can easily emulate the effects of /nl and ^nl using the fitting formulae pip and (|25p without 
having to compute the multidimensional integrals (|20|) and ([22|) . Of course, these formulae depend on the choice the 
primordial power spectrum as well as the window and transfer functions. The fitting formulae are not expected to be 
very sensitive to the changes in any one these ingredients. 



4. NON-GAUSSIANITY BASED ON /nl AND ^nl ONLY 

Much effort has been placed into using the parameters /nl and ^nl to specify the deviation of the primordial 
distribution from Gaussianity. As shown in the previous section, these parameters correspond, at leading order, to 
deviations from Gaussianity in the 3rd and 4th moments of the distribution. It is important to examine if one can 
consistently parametrize a non-Gaussian distribution in this way without having to worry about deviations in the 
higher-order moments. We shall demonstrate that this cannot be the case. 

The objective here is to reconstruct the pdf of density fluctuations given a sequence of mom ents { an. = ((5p), n = 
0, 1, 2 . . .}, which, in practice, can be estimated from galaxy-survey data (ICappi fc Maurogordat o 1995; Kurokawa et al.l 
1199 9^. This is the classic moment problem, which has been studied in great detail, beginning with the pioneering work of 
Stieltjes in 1894 and Hamburger in 1920 (see Kjeldsen (1993) for a historical review). In general, there is no guarantee 
that the resulting pdf will be non-negative, or, indeed, that a solution exists at all. A useful theorem regarding the 
existence of a solution to the Hamburger moment problem, i.e. when the pdf is defined on (—00, cx)), is the following: 



Theorem (Existence of solution to the Hamburger moment problem), 
to moments of a non-negative pdf if and only if the determinants 



The sequence {a„, n == 0, 1, 2 . . .} corresponds 



Dr, 



are all non-negative. 



ao 


ai 


a2 . 


ai 


a2 


as . 


a2 


as 


a4 . 


a„ 


an-i-l 


an-l-2- 



■ On+l 

• an+2 

• a2n 



0,1,2. 



(26) 



See lShohat fc Tamarkiiil (|1963f ) or lAkheizeiJ ()1965[ ) for proof. 

Let us consider the standardised distribution of a; = Sji/ajf, so that aQ = l,ai = and a2 = 1. Let us also 
suppose (as is implicit in some previous works) that non-Gaussianity weakly manifests in the skewness and kurtosis 
only (therefore a^ is close to and 04 is close to 3). Higher moments are taken to be identical to those of the normal 



o'S, 



0.5 



-0.5 




-0.4 



-0.2 



0.2 0.4 



Figure 2. Regions in which the determinant Dn > [see theorem | |26| |]. The horizontal axis shows aS-j, (skewness) and the vertical axis 
shows a'^Si (excess kurtosis). The regions correspond to n = 5 (outermost ellipse) to n = 10 (innermost ellipse). The regions are nested, 
co-tangential and converge towards the origin as n — > oo. The latter fact implies that there is no well-defined non-Gaussian pdf that can 
be described by deviations in the skewness and kurtosis alone. 



distribution 



a,, 



(n - 
0, 



1)!! = l-3-5---(n- 1) 



odd 



(n> 5). 



(27) 



The expressions for Dn up to n = 6 are given below. For convenience, we write s = 0:3 (skewness) and k = a4 — 3 
(excess kurtosis). 

Dg = k - s2 + 2, 

Di = -k^ - 8k2 - 6k - aks^ - 245^ + s* + 12, 

D5 = k^ -f ISk* - eOk^ - 750k2 -f 360k - 45ks2 + 4S>k\^ - ISOOs^ + 105s'' + 288, 

L»6-=945k^ + 11025k* - SOlOOk^ - 324000k2 - 43200k- 132300ks2 + 31500kV - 5184003^ 



99225s* + 34560. 



Note that for the Gaussian distribution (s = k = 0), these DnS are all positive as expected. 

The condition _D„ > always describes a closed region in the (s, k) plane containing the origin and bounded by the 
curve Dn — 0. Figure ^ shows these regions for n = 5 (outermost ellipse) to n = 10 (innermost ellipse). To make a 
connection with later sections, we have labelled the axes as (uS'a, cr'^Si), where 



75*3 = s, a'^Si = k, 



(28) 



as can be easily shown using relation (|18 |) -(jl9 p (to avoid cluttering we sometimes write a to mean ur). As n increases, 
the region corresponding to Z)„ > becomes smaller. Interestingly, the regions for any two consecutive values of n 
are nested and co-tangential. One can continue inductively this way to find that as n — ^ (X), the ellipses converge to 
the origin, implying that there is no room for any deviation from Gaussianity. We conclude that deviations in the 
skewness and kurtosis alone cannot consistently parametrize a non-Gaussian pdf. 

The upshot of all this is that /nl and 5nl by themselves cannot completely describe a non-Gaussian pdf. Information 
on higher-order correlation must be available for the pdf to be well defined. 

5. THE EDGEWORTH EXPANSION 

As described in the Introduction, the Edgeworth expansion is a convenient way to express a weakly non-Gaussian 
pdf as a series comprising its cumulants. Suppose that we only have estimates on /nl and ijnLj and no higher-order 
non-Gaussianity. The result of the previous section shows that the resulting pdf cannot be non-negative. 

Nevertheless, this result only holds if we use an infinite number of cumulants in the reconstruction of the pdf. This 
is equivalent to having an infinite number terms in the Edgeworth expansion. In numerical implementations, however, 
one truncates the Edgeworth expansion after a finite number of terms. As we will see shortly, it now becomes possible 
to describe an entirely non- negative pdf with only /nl and (7nl , circumventing the result of the previous section. The 
disadvantage of the truncation is, of course, that the cumulants of the reconstructed pdf may not correspond exactly 
to those of the actual pdf, and thus the statistical significance of /nl and ^nl is somewhat weakened. For very short 
series of just a few terms, the interpretations of /nl as skewness and (/nl as excess kurtosis are especially dubious. 

Given information on a finite number of cumulants, we shall investigate the sensitivity of the resulting pdf to the 
number of terms in the Edgeworth expansion. This sensitivity has been alluded to by several works in the literature 



puszkiewicz et al]|1995l : IT7o Verde et al.ll2008l : lDesiacques et al.ll2008D . though we believe that our analysis goes beyond 
those works. In particular, we shall argue that the truncated series cannot be used to deduce results for negative ^nl, 
unless some higher-order non-Gaussianities are known. 

5.1. The Petrov development 

In this paper, we shall be using the form of the Edgeworth series given by IPetrovl ()1975[ ). who gave a method of 
calculating the Edgeworth series to arbitrarily high order. Given a non-Gaussian pdf with zero mean and variance cr|j, 
we can express its deviation from Gaussianity as a product of the normal distribution and a Taylor series in crji: 



p{dj,) = niSr) 






<^R 



where N{Sji) is the normal distribution 



N{Sii) 



1 



CTfl; v27r 
and the coefficients E^ in the Taylor series are given by 



cxp 



2^1 



{fc,„} 

where v = — . 



Hs+2M n ri 



5, 



m+2 



(to + 2)! 



(29) 



(30) 



(31) 



We now explain the various components of the coefficient (|3T|) . Firstly, the sum is taken over all distinct sets of 
non-negative integers {fcm}m=i satisfying the Diophantine equation 

ki+2k2 + ■.■ + sks^ s. (32) 

We also define 

r = ki+k2 + ... + ks. (33) 

Next, the function Hn{v) is the Hermite polynomial of degree n. They can be obtained by the Rodrigues' formula 

il„(.) = (-l)'V/^/^^ (e-V2) . (34) 

For example, Hq{v) = 1 and Hi{v) = v. Higher order polynomials can be easily obtained via the recurrence relation 

Hn+i{v) ^ vHn{v) - nHn-i{y). (35) 

Note that iip{5[i) is Gaussian, the cumulants of order > 3 vanish identically, and so do the expansion coefficients (|5T|). 
as one might expect. 

5.2. Validity of the truncated series 

The Edgeworth expansion takes, as input, a sequence of cumulants {Sn\ which are combined with polynomials 
of various degrees up to order N . Therefore, when using the Edgeworth expansion, there are two factors which will 
determine its accuracy, namely 1) the number of available cumulants, and 2) the order N . These two issues are separate 
in the sense that it is possible to expand the Edgeworth series to arbitrarily high order given a limited number of 
cumulants. Both these issues must be analysed to properly monitor the sources of error. 

5.2.1. Linear truncation 
When the Edgeworth series (j29p is truncated at linear order in gr, the resulting pdf is given by 



p{v) - N{v) 



6 



(1^3 _ 3^,) 



(36) 



where v = Sji/gr as before. Observe that if ^3 > 0, a sufficiently large negative v gives p{v) < and, similarly, if 
5*3 < 0, a sufficiently large positive v gives p{i') < 0. For instance, if |S'3| = 0.1, p(i/) becomes negative as early as 
\v\ ~ 3. This implies that a linear truncation of the Edgeworth series is highly suspect and is certainly not suitable 
for calculating, for instance, the mass function whereby high values of density fluctuations are involved. 
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Figure 3. Validity of tlie quadratic Edgeworth expansion I I37II . The shaded region corresponds to the combination of ^3 and 54 for which 
there exists a non- negative pdf over the entire real hne. See the text for further discussions and proof of the bound < a^Si < 4. 

5.2.2. Quadratic truncation 
The Edgeworth series truncated at quadratic order in a]:t yields 



p{v) = N{v) 



1 



6 



H^{v) 



Hiiiy) 



02 

72 



Heiiy) 



(37) 



We wish to determine the combination of S3 and S^ such that piv) is non-negative. The numerical evaluation of piv) 
over a grid of S3 and 5*4 is shown in figure [31 In the figure, we mark points for which p{h') > in the (crfl.S'3, crj^S^) 
plane, in the domain v e [—20, 20]. This reveals a closed region in which p{i^) > 0. In fact, the bounding envelope can 
be found analytically by setting p{i') = p' {v) = 0, but the resu lting equation has a very com plicated parametric form 
which we shall not show here. For details of this technique see lJondeau fc Rockingeij (|2001| ). 

A curious feature of figure [3] is that the excess kurtosis, cr^ iS'4, is limited to a small, non- negative range. We can 
prove this as follows. Setting S3 = 0, the quadratic series can be written as 



N{u) 



= 1 



24 



[{-' - i? - 6] 



(38) 



This expression clearly achieves the minimum when v^ — i = 0. Requiring the minimum to be non- negative establishes 
the upper bound cr^S'4 < 4. Next, if 6*4 < 0, the quartic expression is unbounded from below and so the pdf will be 
negative for some large x. Thus, we must have < cr^S'4 < 4 

The bound for crS'3 is more difficult to establish and we shall not go into the detail here. We simply note since an 
analytic expression describing the shaded region in figure [3] exists, the region in fact represents combinations of 6*3 and 
Si if^L and 5nl) for which the pdf is non- negative on the entire real line and not just in [-20,20]. For higher-order 
truncations, it becomes increasingly difficult to find such a region, as expected given the conclusion in section 2] 

5.2.3. Higher- order truncations 

Figure H] shows the same set of axes as figure [3] with the Edgeworth series now expanded up to terms of order a^ , 
cr^, (T^° and a'^'^ (top row to bottom row). In producing these figures, we have set the rest of the cumulants to zero. 
This is roughly equivalent to parametrizing the non-Gaussianity by /nl and (7nl only. Note that if n is odd, the series 
up to n terms performs significantly worse than one with even n. This is simply because odd (Hermite) polynomials 
are not positive definite, whereas even ones are, provided the coefficients are properly chosen. When scanning over a 
sufficiently large range of z/, an odd-ordered Edgeworth expansion will not produce any well-defined pdf whatsoever. 

The sensitivity of the regions to the range of v considered is clearly seen in the difference between the column on 
the left (in which p{i') is only required to be non-negative for \i/\ < 5) and on the right (|z/| < 20). As the range of ly 
increases, the cluster of points shrinks as it becomes increasingly difficult to find a closed region with p(i/) > 0. 

Observe that for ly S [—20,20], very few models with negative ^4 {i.e. t^NL < 0) are produced. In fact, if the range 
of v is sufficiently large, no models with negative 6*4 are produced at all. A simple explanation for this is as follows. If 
the highest non-zero cumulant of a non-Gaussian distribution is ^4, then, for large v, the Edgeworth series expanded 
to n terms is of order 5*4 j^". Hence, if ^4 < 0, a sufficiently large v will render the expansion negative regardless of the 
value of n. 

Therefore, it is necessary that higher-order non-Gaussianities are taken into account when modelling a non-Gaussian 
distribution with (7nl < 0. For instance, including nonzero cumulants 5*6 and Ss opens up the parameter space to 
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Figure 4. Validity of the Edgeworth expansion in the (pS-i,^ '^'^84) plane for u = 5/ a in the range [—5, 5] (column on the left) and [—20, 20] 
(right) with the series is expanded up to (from top to bottom) a^, a^, a^^ and a^^ . Only deviations in the skewness and the excess kurtosis 
are taken into account, with no higher-order non-Gaussianities. Points correspond to those combinations resulting in a non-negative pdf. 
See the text for further discussions on how the Edgeworth expansion may be properly used to produce non-negative pdfs. 
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8th cumulant, Ss, is positive, as explained in the text. 

those with 6*4 < 0, as shown in figure [S] 

In summary, the Edgeworth series should be expanded up to even order in an to produce a weU-defined pdf. The 
highest cumulant in that case is restricted to non-negative values. The Edgeworth expansion therefore can describe 
models with (7nl < if and only if cumulants of order at least 6 or higher are included. If (7nl = and non-Gaussianity 
is parametrized by /nl only, the Edgeworth expansion is odd-ordered and the resulting pdf is not well-defined. 

Although we have assumed that non-Gaussianity is characterised purely by the 'local' /nl and ^nl parameters, the 
results in this section (as summarised in figures [3][5|) have been established in terms of the cumulants, 5n, and so they 
hold even if there are other types of non-Gaussianity present. The only difference in this case is that it will be more 
complicated to translate the cumulants into /NL-type parameters. For instance, S3 will now comprise a mixture of 
local and non-local contributions 



5, 



/nl m + /nl -'-Si l^yj 

where Xi and X2 are some integral expressions. See iLoVerde et ahl ()2008[ ): [Desiacaues fc Seli"a3 (|20100 for the expres- 
sions for I2 in the case where non-Gaussianity is of the so-called folded or equilateral-triangle type. 



Having understood how to produce well-defined non-Gaussian pdfs using the Edgeworth expansion, we shall now 
look at two applications, namely, the non-Gaussian prediction for abundances of clusters and voids. In what follows, 
we shall focus on the case where /nl — and ^nl > 0. 

6. ABUNDANCE OF MASSIVE CLUSTERS 

Large-scale structures are sensitive to primordial non-Gaussianity on scales much smaller than the CMB (see 
iDes iacaues & Seljak (2Ql3) for a recent review). On these scales, non-Gaussianity can manifest in the changes in 
cluster number co unt and its redshift dependence (iLucchin fc M atarrcsc 1988; Robinson & Baker 2000. Tio Verde et al 



2008t I0gurill2009t ) as well as a scale-dependent halo bias (jDalal et al...2008: .Matarrcse fc Vcrdc..2008; .Wands fc Slosar 



20091) . In this work, we use the Edgeworth approach, in its correct formalism, together with Press-Schechter theory 
to study the effect of non-zero (/nl on the number density of massive clusters. Redshift dependence and the effects on 
the correlation function will be examined in a later publication. 

6.1. Press-Schechter theory 

Let n{M) be the number density of collapsed objects of mass above M. Press-Schechter theory (jPress fc Schechteil 
I1974D gives the differential number density of collapsed objects as 



dn 
dM 



' M dM 



S,/a{M) 



p{v, M)dv, 



(40) 



where p{v, M) is the pdf smoothed by a window function containing mass M and 6c ~ 1.686 is the threshold overdensity 
for spherical collapse. For a non-Gaussian pd0 , iGrossi et al.l ()2009f ) suggest that a good fit to N-body simulations 
can be obtained by using the Press-Schechter mass function modified by the replacement 



6c ^ 0.866 Sc: 



(41) 



^ We note that the r e are a number of o ther formalisms for calculating the non-Gaussian contributions to the mass function. See e.g. 
ID'Amico eFall | [20lOf ) : [Maegiore fc Hiottol f2009a . 2010. 2009b ). 
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Figure 6. Left: The differential number density dn/dM as a function of mass scale M for models with gjviL = 5 X 10^, 1 X 10® and 5 X 10® 
(/nL = 0)- The number density of massive clusters increases with 3nl. Right: Ratios between the non-Gaussian and Gaussian number 
densities. 



(see lMaggiore fc Riottol (|2009bD for a possible theoretical origin.) We make this replacement in our calculations. 

Figure m shows the changes in dn/dM for a range of non-Gaussian models with ^nl = 5 x 10^, 1 x 10^ and 5 x 10^ 
(/nl = in all cases). In these calculations, we keep the Edgeworth expansion up to 10 terms and check that p{i') > 
at least in the range v G [—20, 20]. Outside this range, the p(i^) is sufficiently small and the contribution to the cluster 
abundance on this mass scale is negligible (note that for the normal distribution, iV(20) ^ 10~^^). The values of 
^NL have been chosen to stay within the region of validity (see figure S]). In our case, we require < cr^S'4 < 0.6, 
corresponding roughly to < ^nl ^ 0(10*). 

The general effect of ^nl > is a boost in the number density of th e most massive objects , although a significant boost 
requires the magnitude of ^nl to exceed the CMB-derived bound of iVielva fc Sand (J2010I) . For instance, abundance of 
objects of mass ~ lO^^M© (corresponding to the most massive clusters) is increased by about 10% for .qnl = 5 x 1 0^. 
Nevertheless, values of 5'nl of this magnitude has recently been proposed by Enqvist et al. (jEnqvist et al.l [20101 ) to 
explain the observed excess of massive clusters. Until there is a larger compilation of massive clusters, the possibility 
of non-Gaussianity with (7nl ~ O(IO^) remains a viable. 

6.2. Sensitivity to truncation 

Increasing the number of terms in the Edgeworth expansion does not change the pdf drastically. However, because 
the mass function is extremely sensitive to the exponential tail of the distribution, it is imperative that one keeps as 
many terms as practically possible in the Edgeworth expansion. Exactly how many terms are required will depend on 
a combination of factors such as the range of scales of interest or the redshift at which the calculations are made. 

Figure [7] demonstrates the sensitivity of the Edgeworth expansion to the truncation order. The panel on the left 
shows the pdf for a distribution with ^nl = 5 x 10^ (with higher-order cumulants again equal 0) and R = 8/i~^Mpc. 
The various lines correspond to the number of terms in the Edgeworth expansion. For i> ^ C(l), the pdfs lie almost 
exactly on top of one another, diverging only at the tail ends. However, when dealing with extreme-mass objects, 
keeping just a few terms in the Edgeworth series is inadequate, as seen in the panel on the right. Here, the ratio of the 
non-Gaussian number density dn/dM and the Gaussian value can change by 10% as the number of terms increases 
from 3 to 5 on mass scales beyond lO^^M©. Comparing with figure HI we conclude that fitting the observed abundance 
of massive clusters using a low truncation order would lead to a spuriously high value of ^nl and vice versa. 

We note that is likely that the sensitivity to the truncation order could increase significantly with redshift. We shall 
address this issue in a forthcoming work. 

7. ABUNDANCE OF VOIDS 

Primordial non-Gaussianity also changes the abundance of underdense regions, i.e. cos mic voids. An estim ate of 
void abundance ca n be computed by a simple extension of the Press-Schechter formalism (jKamionkowski et al 



iBiswas et al.ll20ldl) , although there ar e more sophisticated methods based o n the void probabilitv function (jWhite 



200 



197 



or the eigenvalues of the tidal tensor ()Doroshkevich|[l"970l : iLam et al.ll2009[ ). Presently, we shall use the Press-Schechter 
approach with the Edgeworth expansion to calculate the effect of (^nl on the void abundance. 

A void can be defined a s an isolated region in w hich — 1 < 5 < 6y , where J„ is some th reshold underdensity. 
Simulations carried out bv iShandarin et al. (20061. P ark fc Led (120071) and iColberg et all ([200 8) suggest Sy « -0.8. 
Linearly extrapolating this value to z = using the fitting formula of IMo fc White! ( 19961) gives Sy = —2.75. 

Let Prob<5^ (x) be the probability that S < Sy at x. Since Prob<5„ = 1 — Prob>5^ , differentiating this expression 
with respect to R shows that the analog of equation HOI for voids can simply be obtained by the replacement Sc — >■ Sy 
and a change in the overall sign. 

Figure |8] shows the differential number density dn/dR for models in which ^nl = 5 x 10^, 1 x 10^ and 5 x 10® , 
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showing the factor of enhancement. Voids appear to respond much more sensitively to g^L compared to clusters (see figure [6ll. 

plotted against the smoothing scale R. By increasing f/NL, the void abundance is enhanced and responds much more 
sensitively than the cluster abundance (compare figures [5] and [S]). For example, in the extreme case where (/nl = 5 x 10^, 
at i? = 20 h^^Mpc [M « 10^^ H^^Mq), the enhancement in the differential abundance compared to the Gaussian 
prediction is roughly 10% for clusters, but as large as 60% for voids. This suggests that large voids may be a more 
sensitive probe of primordial non-Gaussianity than massive clusters, although a more careful calculation is needed to 
confirm this. 

Another interesting observation is that ^nl > enhances both cluster and void abundances. This is in contrast 
with the effect of /nl > Oi which enhances th e number density of clusters, but suppresses the number density of voids 
(|Lam et al.l 120091 : iKamionkowski et al.ll2009D . Comparing the abundances of clusters and voids may offer a way to 
probe any asymmetry in the distribution of density fluctuations. 



CONCLUSIONS 



The key results in this paper are as follows 



• We clarified the statistical meaning of the local non-Gaussianity parameters /nl and 5nl, which, at leading 
order, are proportional to the skcwness and excess kurtosis of the distribution of density fiuctuations. These 
relations are in the form of multi-dimensional integrals, which can be fitted by simple formulae (j21|) and ()25|) . 
They are accurate on the mass scale lO^'^ — 10^^ Mq. 

• We showed that the information in /nl and (/nl is insufficient for a reconstruction of the pdf of density fiuctua- 
tions. Using a theorem from the classical Hamburger moment problem, we showed that there is no positive pdf 
which deviates from Gaussianity only in the 3rd and 4th moments. 
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• We studied the truncated Edgeworth series, emphasising that in this representation, /nl and ^nl may not 
accurately reflect the skewness and excess kurtosis of the reconstructed pdf, especially for shorter truncations. 
We surveyed the skewness-kurtosis plane for regions of validity (i.e. where the pdf is non-negative) for various 
truncations of the Edgeworth series. We proved that the Edgeworth expansion can represent a non-negative pdf 
if it is truncated at even order in a, with the highest-order cumulant restricted to non-negative values. In terms 
of local non-Gaussianity, this means that the Edgeworth series cannot be used to represent models with nonzero 
/nl without considering nonzero ^nl also. It also means that models with (Jnl < are not representable by a 
truncated Edgeworth series unless the non-Gaussian deviation in the 6th moment (or higher) is known. 

• Working with a lOth-order Edgeworth series, we calculated the effects of g^i^ on the cluster number density, 
dn/dAI, using the Press-Schechter formalism (see figure [5]). The differential abundance of the most massive 
clusters can increase significantly if gNL ^ 10^. We cautioned that the deduced value of ^nl from large-scale 
structures may be spuriously high if the series is prematurely truncated. 

• Finally, we extended the Press-Schechter approach to compute the effects of ^nl on the abundance of large 
voids. The void number density is enhanced much more sensitively compared to clusters (figure [5]). This could 
be confirmed by a more sophisticated calculation (e.g. using correlation functions to calculate the void probability 
distribution). We shall address this issue in a future publication. 
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